Intonational Features of Local and Global Discourse Structure

نویسندگان

  • Julia Hirschberg
  • Barbara J. Grosz
چکیده

1. ABSTRACT We present results of a study of the relationship between intonational features including pitch range, timing, and amplitude and aspects of discourse structure defined in terms of Grosz and Sidner's (1986) model of discourse. We compare structural labelings of AP news text with prosodic/acoustic features examined from recordings of the same text read by a professional newscaster. We find significant correlations between prosodic/acoustic characteristics and both local and global aspects of discourse structure identified by our labelers. Our results have applications for speech synthesis and, potentially, for speech recognition. The hypothesis that discourse structure is signalled by variation in intonational features such as pitch range, timing, and amplitude has been examined in studies such as [1, 2, 3, 4, 5, 6, 7]. However, as Brown and her colleagues note [2, p. 27]: "... until an independent theory of topic-structure is formulated, much of our argument in this area is in danger of circularity." In this paper we examine the relationship between discourse structure and variation in intonational features using just such an independent model of discourse structure, that proposed by Grosz and Sidner [8] (G&S). We present results of an empirical study comparing intonational features of read text with elements of both the local and global structure of discourse. Our study has immediate application to the generation of appropriate intonational features for synthetic speech, and future applicability to the recognition of discourse structure in speech recognition tasks. Our corpus consisted of AP news stories recorded by a professional speaker. The intonational features we considered included pitch range, contour, timing, and amplitude. The discourse structural elements we examined at the local level included parentheticals, quotations, tags, and indirect reported speech; at the global level, we studied discourse segmentation-the division of a discourse into constituents that provide the basis for determining discourse meaning. The discourses were labeled by two groups: one group labeled from text; the other group labeled from text while listening to the recorded speech. In this paper, we describe similarities and differences in the segmentations elicited in these two conditions. Our experiments provide support for three hypotheses. First, instructions can be devised, based on the G&S model, that enable subjects to analyze discourses with considerable similarity. Second, discourse structure is marked intonationally, although the relationship between structure and intonational features is a complex one; a given discourse structural feature may be sig-naled by several intonational features, either …

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Some intonational characteristics of discourse structure

This paper reports on a study of the relationship between acoustic-prosodic variation and discourse structure, as determined from an independent model of discourse. We present results of two pilot studies. Our corpus consisted of three AP news stories recorded by a professional speaker. Discourse structure was labeled by subjects either from text alone or from text (with all or-thographic marki...

متن کامل

The Prosody of Discourse Structure and Content in the Production of Persian EFL Learners

The present research addressed the prosodic realization of global and local text structure and content in the spoken discourse data produced by Persian EFL learners. Two newspaper articles were analyzed using Rhetorical Structure Theory. Based on these analyses, the global structure in terms of hierarchical level, the local structure in terms of the relative importance of text segments and the ...

متن کامل

Syntax-prosody Interface for Romanian within Information Structure Theories

The following main ideas have been pointed out and put to work within our paper: (a) Information Structure (IS) theories (topic-focus, theme-rheme, background-contrast, informational and contrastive focus, focus projection rules, etc.) on text are shown to behave currently as a consistent linguistic tool that can stand behind a correct, language and contextual-depending, mapping of text into sp...

متن کامل

Discovering the Sounds of Discourse Structure Extended Abstract

It is widely accepted that discourses are composed of segments and that the recognition of segment boundaries is essential to a determination of discourse meaning (Grosz and Sidner, 1986). Written language has orthographic cues such as section headings, paragraph boundaries, and punctuation which can assist in identifying discourse structure. In spoken language, into-national variation provides...

متن کامل

Assigning Intonational Features in Synthesized Spoken Directions

Speakers convey much of the information hearers use to interpret discourse by varying prosodic features such as phrasing, pitch accent placement, tune, and pitch range. The ability to emulate such variation is crucial to e ective (synthetic) speech generation. While text-tospeech synthesis must rely primarily upon structural information to determine appropriate intonational features, speech syn...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992